High stakes testing in higher education and employment: appraising the evidence for validity and fairness.
نویسندگان
چکیده
The authors review criticisms commonly leveled against cognitively loaded tests used for employment and higher education admissions decisions, with a focus on large-scale databases and meta-analytic evidence. They conclude that (a) tests of developed abilities are generally valid for their intended uses in predicting a wide variety of aspects of short-term and long-term academic and job performance, (b) validity is not an artifact of socioeconomic status, (c) coaching is not a major determinant of test performance, (d) tests do not generally exhibit bias by underpredicting the performance of minority group members, and (e) test-taking motivational mechanisms are not major determinants of test performance in these high-stakes settings.
منابع مشابه
Interpreting the Validity of a High-Stakes Test in Light of the Argument-Based Framework: Implications for Test Improvement
The validity of large-scale assessments may be compromised, partly due to their content inappropriateness or construct underrepresentation. Few validity studies have focused on such assessments within an argument-based framework. This study analyzed the domain description and evaluation inference of the Ph.D. Entrance Exam of ELT (PEEE) sat by Ph.D. examinees (n = 999) in 2014 in Iran....
متن کاملDifferential Item Functioning (DIF) in Terms of Gender in the Reading Comprehension Subtest of a High-Stakes Test
Validation is an important enterprise especially when a test is a high stakes one. Demographic variables like gender and field of study can affect test results and interpretations. Differential Item Functioning (DIF) is a way to make sure that a test does not favor one group of test takers over the others. This study investigated DIF in terms of gender in the reading comprehension subtest (35 i...
متن کاملA Review of Internet-Centered Language Assessment: Origins, Challenges, and Perspectives
This article defines the origin of an internet-centered language assessment (ICLA), how ICLAs are different from the other traditional computer-oriented tests, and what uses and functions ICLAs have in different taxonomies of language testing. After a very short review of computer- oriented testing, ICLAs are defined and categorized in low-tech or high tech categories. Since low-tech tests are ...
متن کاملValidity of the Iranian Version of Health Utility Index Mark 3 Quality of Life Questionnaire
Background: The aim of this study was to standardize and develop the health utility index III (HUI3); quality of life questionnaire. This study was conducted for the first time in Iran. Method: Forward-backward translation method was applied in order to translate the Canadian version into Persian. The final version was developed after modifica...
متن کاملAssessing Assessment Literacy: Insights From a High-Stakes Test
This study constitutes an attempt to see what Language assessment literacy (LAL) isfor three groups of stakeholders, namely LAL test developers, LAL instructors, andLAL test-takers. The perceptions of the former group were derived from the contentanalysis of the latest version of the LAL test, and those of the latter 2 groups wereassessed through a survey designed by the researcher. Participant...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- The American psychologist
دوره 63 4 شماره
صفحات -
تاریخ انتشار 2008